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Abstract. We propose a comprehensive research framework to empirically 
investigate complex visual i nference tasks, support mechanisms (animated 
transition using morphing or vector overlay), and how spatial ability affects 
people's learning and knowledge construction process from Spatially Or- 
dered Treemaps (SOTs) as compared to conventional choropleth maps. 
This effort is inspired by the call of the new I nternational Cartographic As- 
sociation commission on Cognitive Visualization (CogVis), which proposes 
"developing a sound theoretical framework based upon cognition and per- 
ception discipline" (Fabrikant, 2011). The framework aims to explore SOTs 
i n the context of both 'i n- vitro' and 'i n-vi vo' setti ngs. This study is grounded 
in cartography but also conforms to experimental design standards in per- 
ception and cognitive sciences. 

Keywords: Spatially Ordered Treemaps, Geovisualization, Visual Tasks, 
Spatial Ability 



1. Introduction 

We now live in an era of data deluge, where our ability to generate data 
outstrips our ability to analyse it. Organizations collect and have access to 
large amounts of data with the aim of getting some benefit from it. Whilst 
organizations acknowledge the value and i mportance of thei r data, many do 
not know how to make sense of the i nformation or what to do with it (Few, 
2011). Maps have a long tradition of being used to make sense of informati- 
on. However, in this data dense era, conventional methods and traditional 
GI S tools do not support or meet the constant emerging user needs (Andri- 
enko et al., 2007). Spatially Ordered Treemaps (SOTs) are designed to 
address some of these needs (Wood & Dykes, 2008). SOTs are space-filling 
graphics that show hierarchical geographic information in a space efficient 
way, using one rectangle per data item in an iterative manner through the 
hierarchy. Unlike the conventional choropleth maps (usually sized bygeo- 



graphical area), SOTs can be sized according to need - for example by po- 
pulation (number of people living in an area) to result in a spacefilling hie- 
rarchical cartogram (Wood & Dykes, 2008a; 2008b). Although, SOTs have 
been used in various applications eg. in local government (Figure 1} to ma- 
nage and allocate resources (LCC, 2010), we lack empirical evidence on how 
effective the technique is in communicating spatio-temporal data either for 
educational purposes or for knowledge discovery. 

1.1. Theory 

Treemaps are space-filling graphics that show hierarchical data in a space 
efficient way using one rectangle per data item in an iterative manner 
through the hierarchy (Shneiderman, 1992). Most treemap layout algo- 
rithms use one-dimensional ordering of rectangles. With SOTs, two- 
dimensional ordering is used and rectangles corresponding to geographical 
areas are arranged in an approximate spatial layout to produce "a space 
filling cartogram" (Wood & Dykes, 2008a; 2008b) that distorts spatial 
geometry to reflect thematic information. A cartogram is a map that distorts 
spatial geometry to correspond to a theme (Tobler, 2004). In light of these 
spatial distortions, support mechanisms such as morphing, overlay vectors 
and colour have been designed in SOTs to aid the user in understanding the 
topology after transformation (Wood & Dykes, 2008b). 

Research efforts on treemaps can be broadly divided into two: development 
of effective treemap algorithms and evaluation of use of the outputs (Kong 
etal., 2010). But none of these efforts deal with the kinds of complex spatial 
tasks for which maps are so effective (MacEachr en, 1995). 
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Figure L A SOT (left) and a conventional geographical map (right) of Leicester- 
shire showing satisfaction (purple) and dissatisfaction (orange) with public ser- 
vices. Population and hierarchy are emphasized and local dissatisfaction in Hinck- 
ley & Bosworth is more evident in the population sized SOT when compared to the 
geographic map. 



2. Research Framework 



Our framework aims to formally evaluate the effectiveness of treemaps in 
the geographic context for the first time. Our aim is to increase our under- 
standing of graphical perception with regards to SOTs. This will in turn 
inform us on the strengths and limitations of SOTs for communicating and 
exploring geographic information. Table 1 gives an overview and charac- 
terise cs of the study at different stages. The specifi c tasks for stages I a, I b & 
I c are further elaborated in Tables 2, 3 & 4. The tasks for stage 1 1 are dis- 
cussed in section 3. 



Context 


'in-vitro' 


'in-vivo' 


Stages 


Stage la 


Stage lb 


Stage Ic 


Stage II 


Tasks 


Locate 


Compare 


Distribution 


Distribution, 
Interaction 


Data collection 


Online Questionnaires 


Questionnaires, 
Interview & 
Observation 


Data type 


Quantitative 


Qualitative 


Participants' 
characteristics 


Our target population are numerate information seekers 
with experience of using data and graphics 


Experts 


Sample size 


Many 


Few 


Study design 


Between- subject 


Case study 



Table L An overview of the proposed study 

2.1. Stage I: Identifying suitable tasks for SOT 

Under the 'in-vitro' setting, the key question under consideration is: "to 
what extent are people able to interpret data and geography in a SOT as 
they would in choropleth maps". This will be evaluated through spatial 
graphical perception tasks that require regional comparison and the inter- 
pretation of general spatial distributions. Our independent experimental 
factors are map type and task. 

2.1.1. Map type 

I nformationally equivalent stimuli (Figure 2) using the choropleth and SOT 
techniques will be generated. Numeric calculation of distance, angle dis- 
placement and adjacency will be used to establish the degree to which the 
SOT reflects the conventional mapping. Choropleth will be sized by land 
area, whileSOTs will be sized by attribute i.e. population. Characteristics to 
be held constant between the two displays include col our, padding, outlines 
and the depth of hierarchy. The geometry of regions, number of enumerati- 



on units and spatial characteristics of the mapped distribution will be vari- 
ed systematically. Spatial characteristics of the distribution (data) will be 
varied according to established spatial statistics. 




Figure 2. A SOT and equivalent Choropleth map showing the geographic 
hierarchy: the whole area, regions (A,B,C,D) and districts (1,2,3) ■ 



2.1.2. Tasks 

Tasks types are informed by a comprehensive review of the literature (e.g. 
Bertin, 1967; Andrienko & Andrienko, 2006). Tasks have been designed to 
reflect the multiple and complex ways in which maps are used and the hi e- 
rarchal emphasis for which SOTs were proposed. Thus, we separated tasks 
that require participants to only use spatial thinking (Geographical), tasks 
that do not require spatial thinking (Statistical) and tasks requiring both 
forms of (Statistical Geographical) thinking. These latter tasks are those 
for which SOTs were designed, but they rely upon performing the indvi dual 
task types effectively. Each task is further divided into direct or inverse, 
according to (Andrienko & Andrienko, 2006): When reading maps we are 
either interested in the characteristic that correspond to a given target (di- 
rect) or what target corresponds to a given characteristics inverse. Hierar- 
chical tasks(marked {H}) are also integrated in this framework with indivi- 
dual spatial units (districts) considered in the context of groups of such 
units (regions). These tasks demand varying cognitive effort. It is hypothe- 
sized that statistical and statistical geographical tasks may be performed 
more effectively using SOTs- particularly when these involve hierarchy- at 
the expense of some geographi c accuracy. 



Statistical 


Direct 


Find the value of a specific district/region 
What is the value of district A /region 1 ? 
Describe (What is) the value (of population) within region 1 (Hj 


Inverse 


Restricted by a value (high/low) find a district or a region. 

List 5 districts with high values 

List 5 districts with high values within region 1{H} 


Statistical/Geographical 


Direct 


Find districts/regions restricted by value relations then by geogra- 
phical location. 

List 5 districts with high values that are to south of region 1 

List 5 districts with high values that are north of district A and 

within region 1 (Hj 


Inverse 


Find districts/regions restricted by geographical location then by 
value relation 

Find 5 districts that are north of district A that have high values. 
Find 5 districts within the same region as district A and south of 
district B (or C) that have high values (H) 


Table 2. Locate tasks: requiring value estimation for individual spatial units or 
regions. 


Statistical 


Direct 


Find and compare value of a specific district with others. 
Between districts A, B, C & D, which one has the lowest value 
Between regions 1,2, 3 & 4, which one has the highest value? (H) 


Inverse 


Restricted by value (high to low) districts should be ordered in this 
sequence. 

From high to low, arrange districts A, B, C & D in order of overall 
population 

From high to low, arrange regions 1,2,3 & 4 in order of overall 
population (Hj 


Geographical 


Direct 


Judge distance and find the location between districts/regions 
Between districts A, B and C, which district is closer to district D? 
Between districts A, B and C, which district is [either within region 
1 1 within the same region as district D j within a different region to 
district D]?{H} 


Inverse 


A district is restricted by location, then judging by distance to a 
specific district/region 

Between districts A, B, C & D, which district is closest to both 
regions 1 and 2? 

Between districts A, B, C & D, which district are in the same region 
as district 1 and 2?{H} 


Statistical/Geographical 


Direct 


Find districts restricted by value relations then by geographical 
locations. 

Are the districts with high values closer to A or B? 

Are the districts with higher values within region 1 or2?{Hj or 

In which direction are the districts with high values from A? 

Are the districts with high values within the same region as district 

1 or2?{H} 


Inverse 


Find districts restricted by geographical location then by value 
relations 

Are more districts with values like C closer to A or B? 

Are more districts with higher values than C within region A or 

B?{H} 



Table 3. Comparison tasks: requiring geographical (distance) or value estima- 
tion judgment between spatial units or regions. 



Flat : Identify similarity between maps (SOTs or Choropleth) by looking at spatial distribution. 

For example, participants are presented with a SOT, next to it there are 5 choropleths. 



Which map looks most similar to the one presented? 



Table4. Distribution tasks: requiring overall area assessment of the map. 



3. Stage II: Identifying support mechanisms and met- 
aphors for supporting data exploration when using 
SOTs. 

The 'in-vivo' setting in which we will be working involves epidemiologists 
who use maps frequently. The key question under consideration is: "to 
what extent are support mechanisms useful during data exploration and 
interpretation of general spatial distributions?" Cartograms have been 
found to be difficult to interpret without some kind of support (Tobler, 
2004). Similarly, si nee SOTs distort locations and result in new layouts that 
are sensitive to aspect ratio, somewhat arbitrary and non-linear in their 
transformation of space. We intend to evaluate how various support me- 
chanisms may help establish their geography. Firstly, we are considering 
animated transitions (eg. morphing) for comparing one layout to another 
so that geographic and SOT locations can be compared (Heer & Robertson, 
2007). This is because there is some evidence animated transition aids in 
understanding statistical graphics. Secondly we are considering linked 
views with displacement vectors (Wood & Dykes, 2008b) that visually indi- 
cate the topologic distortion: thelength of the vector indicates distance whi- 
I e curvature i ndi cates the di recti on of di storti on from the el ement's ori gi nal 
location. 



4. Investigating the effects of spatial ability on tasks 

Data exploration and map reading is more than a perceptual comparison of 
symbols, it is a cognitive process (MacEachren et al., 1992). There is evi- 
dence that cognitive ski I Is such as spatial ability have an effect on visualiza- 



tion tasks and can be used to identify individual differences (Hegarty & 
Waller, 2006). Identifying individual differences is key because once a 
group is known, tools or educational training can be customized towards 
their needs (Slocum et al., 2001). However, very few studies in cartography 
and geovisualization have studied individual spatial ability differences 
(Wilkening & Fabrikant, 201]). Two forms of spatial ability test, paper fol- 
ding and mental rotation will investigate how subjects' task performance 
(with respect to SOTs) is modulated by their spatial ability to give us in- 
sights into why and for whom this spatial transformation may be effective 
(Fabrikant, 2011). 



5. Work plan 

I n the comi ng months we i ntend to run several experi ments usi ng the tasks 
identified in section 2.12. The target sample is numerate information see- 
kers with experience of using data and graphics who are also potential users 
of SOTs. We are currently recruiting participants having received ethical 
approval to undertake the tests from our institution. 



6. Conclusion 

We propose a research framework for empirically assessing the effectiven- 
ess of SOTs for communicating and exploring geographic information 
through relative complex tasks involving geography, statistics and hie- 
rarchy. The proposed study is grounded in cartography and it is informed 
by experimental design standards from perceptual and cognitive discipli- 
nes. We first approached this research by systematically identifying and 
designing complex perception tasks for which SOTs are designed. We spe- 
cifically focused on tasks that require regional comparison and theinterpre- 
tation of general distributions. Wewill also measure subjects' cognitive abi- 
lities using spatial abilities test such as mental rotation and paper folding 
to val i date their performance. The experi mental setti ng wi 1 1 i nd ude both i n- 
vitro and in-vivo settings. With these studies we hope to increase our 
knowledge of graphical perception and better understand how people make 
inferences and interpret, use and learn the abstract semi -geographic layouts 
produced by the SOT during data exploration for knowledge discovery and 
construction. 
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